Filtering the UMLS ® Metathesaurus ® for MetaMap 2010

نویسنده

  • Alan R. Aronson
چکیده

The MetaMap program’s purpose is to discover the Metathesaurus concepts referred to in arbitrary text. A given Metathesaurus concept can have many alternative names (Metathesaurus strings) which originate in the many source vocabularies included in the Metathesaurus. As the number of strings has grown over the years, MetaMap’s performance has suffered. In the 2010AA version of the Metathesaurus, for example, the Metathesaurus includes 5,394,495 English strings, 5,338,590 (98.96%) of them distinct, comprising 2,194,659 concepts. There are 2.20% more English strings and 3.51% more concepts than in the 2009AA edition. Many of the strings in the Metathesaurus are of little value to MetaMap for one of four reasons: 1. Some strings are virtually indistinguishable from each other; for efficiency, only one representative of a set of indistinguishable strings is needed. 2. Some strings either represent general, nonmedical concepts, are unnecessarily ambiguous, or have been found to be problematic for some other reason. 3. Some strings have an assigned type in their vocabulary because they have a form (e.g., an idiosyncratic abbreviation) that is highly unlikely to appear in regular text. 4. Some strings, including lengthy descriptions of things such as procedures, health activities or medical devices, are so complicated that it is again unlikely to find them in normal text.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Filtering the UMLS ® Metathesaurus ® for MetaMap 2012 Edition

The MetaMap program’s purpose is to discover the Metathesaurus concepts referred to in arbitrary text. A given Metathesaurus concept can have many alternative names (Metathesaurus strings) which originate in the many source vocabularies included in the Metathesaurus. As the number of strings has grown over the years, MetaMap’s performance has suffered. In the 2011AA version of the Metathesaurus...

متن کامل

Filtering the UMLS® Metathesaurus® for MetaMap 1999 Edition

MetaMap’s primary purpose is to provide a basis for further processing of biomedical text by finding the Metathesaurus concepts referred to in the text. A given Metathesaurus concept can have many alternative names (Metathesaurus strings) which originate in the many source vocabularies included in the Metathesaurus. As the number of strings has grown over the years, MetaMap’s performance has su...

متن کامل

Filtering the UMLS ® Metathesaurus ® for MetaMap 2009

The MetaMap program’s purpose is to discover the Metathesaurus concepts referred to in arbitrary text. A given Metathesaurus concept can have many alternative names (Metathesaurus strings) which originate in the many source vocabularies included in the Metathesaurus. As the number of strings has grown over the years, MetaMap’s performance has suffered. In the 2009AA version of the Metathesaurus...

متن کامل

Filtering the UMLS ® Metathesaurus ® for MetaMap 2011 Edition Francois

The MetaMap program’s purpose is to discover the Metathesaurus concepts referred to in arbitrary text. A given Metathesaurus concept can have many alternative names (Metathesaurus strings) which originate in the many source vocabularies included in the Metathesaurus. As the number of strings has grown over the years, MetaMap’s performance has suffered. In the 2011AA version of the Metathesaurus...

متن کامل

Improving Summarization of Biomedical Documents Using Word Sense Disambiguation

We describe a concept-based summarization system for biomedical documents and show that its performance can be improved using Word Sense Disambiguation. The system represents the documents as graphs formed from concepts and relations from the UMLS. A degree-based clustering algorithm is applied to these graphs to discover different themes or topics within the document. To create the graphs, the...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1991